Critiquing the Tileworld: Agent Architectures, Planning Benchmarks, and Experimental Methodology
نویسندگان
چکیده
AI Planning for many years was concerned with solving problems in small, controlled domains like the Blocksworld, and testing the algorithms on small, pathological problems like the Sussman anomaly. Lately the eld has taken on two new goals: to apply planning techniques to more realistic worlds, and to nd a better way to validate the research eorts. One solution that is gaining popularity is to provide more-or-less realistic simulated worlds, in which one implements a planner and provides experimental results re BLOCKINecting its performance. The Tileworld system, reported in [Pollack and Ringuette 1990], confronts three interesting issues: what constitutes a good simulated testbed world for planning research, what constitutes a successful implementation of a planning architecture, and what constitutes experimental validation of such an implementation. This paper examines the program, claims, and experimental results contained in the Tileworld paper, and oers comments about the dicult issues facing a eld that is trying to apply its methods to more realistic problems and simultaneously trying to nd more rigorous and satisfying ways of measuring its progress.
منابع مشابه
Benchmarks, Testbeds, Controlled Experimentation, and the Design of Agent Architectures Experimentation, and the Design of Agent Architectures
The methodological underpinnings of AI are slowly changing. Benchmarks, testbeds, and controlled experimentation are becoming more common. While we are optimistic that this change can solidify the science of AI, we also recognize a set of diicult issues concerning the appropriate use of this methodology. We discuss these issues as they relate to research on agent design. We survey existing test...
متن کاملPlanning Agents in James
Testing is an obligatory step in developing multi-agent systems. For testing multi-agent systems in virtual, dynamic environments, simulation systems are required that support a modular, declarative construction of experimental frames, that facilitate the embeddence of a variety of agent architectures, and that allow an efficient parallel, distributed execution. We introduce the system James (A...
متن کاملA history of the Tileworld agent testbed
This paper looks at the history and development of an agent testbed called Tileworld. It defines the original testbed and documents its gradual development up to its current form. It also introduces some of the experiments performed using Tileworld and their results. It concludes with a comparison of Tileworld with other agent testbeds and a look at agent testbeds as a whole. Tileworld A Histor...
متن کاملBenchmarks, Test Beds, Controlled Experimentation, and the Design of Agent Architectures
The methodological underpinnings of AI are slowly changing. Benchmarks, testbeds, and controlled experimentation are becoming more common. While we are optimistic that this change can solidify the science of AI, we also recognize a set of di cult issues concerning the appropriate use of this methodology. We discuss these issues as they relate to research on agent design. We survey existing test...
متن کاملEmotion as an Enabler of Co-operation
Human reasoning and behaviour is undoubtedly influenced by emotions. However, the role of emotion in reasoning has, until recently, been viewed as secondary, with preference given to game theory principles in order to explain how the reasoning of an individual affects sociable interaction and the phenomenon of cooperation. Despite this, development of emotional agent architectures has gained in...
متن کامل